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Abstract 

Various new nonembeddability results (mainly into Li) are proved via Fourier analysis. In 
particular, it is shown that the Edit Distance on {0,1}'' has Li distortion (log(i)3^°(^^ We 
also give new lower bounds on the Li distortion of flat tori, quotients of the discrete hypercube 
under group actions, and the transportation cost (Earthmover) metric. 

1 Introduction 

The bi-Lipschitz theory of metric spaces has witnessed a surge of activity in the past four decades. 
While the original motivation for this type of investigation came from metric geometry and Banach 
space theory, since the mid-1990s it has become increasingly clear that understanding metric spaces 
in the bi-Lipschitz category is intimately related to fundamental algorithmic questions arising in 
theoretical computer science. Despite the remarkable list of achievements of this field, which 
includes the best known approximation algorithms for a a wide range of NP hard problems, the bi- 
Lipschitz theory is still in its infancy. In particular, there are very few known methods for proving 
nonembeddability results. The purpose of this paper is to the develop a Fourier-analytic approach 
to proving nonembeddability theorems. In doing so, we resolve several problems, and shed new 
light on existing results. Additionally, our work points toward several interesting directions for 
future research, with emphasis on the study of the bi-Lipschitz structure of quotients of metric 
spaces. 

Let {X, dx) and (Y, dy) be metric spaces. The Lipschitz constant of a function f : X ^ Y is 



Lip := sup 



dYifix)Jiy)) 



x,yex dx{x,y) 

If / is one-to-one then its distortion is defined as 

dist(/) := ll/llLip-lir^Lip. 

If / is not one-to-one then we set dist(/) = oo. The least distortion with which X can be embedded 
into Y is denoted cy(X), namely 

cy{X) := inf{dist(/) : f:X^Y]. 
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We are particularly interested in embeddings into Lp spaces. In this case we write Cp{X) = 
cl^[X). The most studied type of embeddings are into Hilbert space, in which case the parameter 
C2{X) is known as the Euclidean distortion of X. The parameter ci{X), i.e. the least distortion 
required to embed X into Li, is of great algorithmic significance, especially in the study of cut 
problems in graphs. The Euclidean distortion of a metric space X is relatively well understood: 
it is enough to understand the distortion of finite subsets of X, and for finite metrics there is a 
simple semidefinite program which computes their Euclidean distortion [lUI- Embeddings into Li 
are much more mysterious (see |39p. and there are very few known methods to bound ci{X) from 
below. 

The present paper contains several new nonembeddability results, which we now describe. The 
common theme is that our proofs are based on analytic methods, most notably Fourier analysis on 
{0, l}*^ and M". We stress that this is not the first time that nonembeddability results have drawn 
on techniques from harmonic analysis. Indeed, the proofs of results in |171 1561 1521 1351 146j all have 
a Fourier analytic component. 

Our results. 

1) Quotients of the discrete hypercube and transportation cost. A classical theorem of 
Banach states that every separable Banach space is a quotient of ii. More precisely, for every 
separable Banach space X, there is a linear subspace Y such that ii/Y is linearly isometric 
to X. This suggests that interesting "bad examples" of metric spaces can be obtained as metric 
quotients of the Hamming cube. Roughly speaking, this says that we can obtain interesting metrics 
(i.e. metrics that do not embed into nice spaces, say Li) by identifying points of the hypercube. 
Quotients of metric spaces are a well studied concept (see j^Tl for an introduction, and [32] 
for a discussion of quotients of finite metric spaces)- we refer the reader to Section |31 for a precise 
definition of this notion. 

Motivated by this analogy, in Section 01 we exhibit classes of quotients of the Hamming cube 
which do not embed into Li. A fundamental theorem of Bourgain JH] states that for every finite 
metric space X, ci{X) < C2{X) = 0(log \X\). In [T^ Bourgain used a counting argument to show 
that there exist arbitrarily large metric spaces X with C2{X) = 0(log |X|/loglog \ X\). In j4()l in] it 
was shown that there exit arbitrarily large metric spaces X with ci{X) = O(log |X|) (namely X can 
be taken to be a constant degree expander). In Section |31 we show that there exist simple n-point 
quotients of the Hamming cube {0, l}'^ which incur distortion Q(logn) in any Li embedding. We 
also show that certain quotients of the Hamming cube obtained from the action of a transitive 
permutation group of the coordinates do not well-embed into Li. These results are proved via a 
flexible Fourier analytic approach. 

As an application of the results stated above we settle the problem of the Li embeddability of the 
transportation cost metric (also known as the Earthmover metric in the computer vision/graphics 
literature) on the set of all probability measures on {0, l}"^. Denoting by P({0, l}'^) the space of 
all probability measures on the Hamming cube {0, 1}"^, let Tp{a,T) denote the transportation cost 
distance between ct, r G P({0, l}*^), with respect to the cost function induced by the Hamming 
metric p (see Section 13.21 for the deflnition) . Such metrics occur in various contexts in computer 
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science: they are a popular distance measure in graphics and vision j^HEl], and they are used 
as LP relaxations for classification problems such as 0-extension and metric labelling [231 1211 12] . 
Transportation cost metrics are also prevalent in several areas of analysis and PDEs (see the 
book and the references therein). 

Motivated by applications to nearest neighbor search (a.k.a. similarity search in the vision 
literature), the problem of embedding transportation cost metrics into Li attracted a lot of atten- 
tion in recent years (see jnHEllllI)- In HH OH it is shown that ci{V{{0,lY),Tp) = 0{d). In 
Section 13.21 we show that this bound is optimal, i.e. ci {V{{Q,lY),Tp) = n{d). From an analytic 
perspective, Kantorovich duality (see [Snj) implies that (^({0, 1}°'), 7^) embeds isometrically into 
Lip({0, l}*^)*- the dual of the Banach space of all real valued Lipschitz functions on the hypercube. 
A result of Bourgain [2] implies that sup^^gpj ci(Lip({0, 1}"')*) = oo. Our result shows that in fact 
ci(Lip({0, 1}^)*) = ^{d), improving upon the lower bound obtained in |14j . 

2) Edit Distance does not embed into Li. Edit Distance (also known as Levenstein dis- 
tance |38j ) is a metric defined on the set of all finite-length binary strings, which we denote {0, 1}*. 
This metric is best viewed as the shortest path metric on the following infinite graph : Let G 
be a graph with set of vertices {0, 1}*, and {x,y} is an edge of the graph if the string y can be 
obtained from string x by either deleting one character from x or by inserting one character into 
X. For strings x,y, denote the shortest path distance in G (i.e. the Edit Distance) between x,y as 
ED(x, y). In words, ED(2;, y) is the minimum number of edit operations needed to transform x into 
y. Here we assume that only insertion/deletion operations are allowed. Character substitution can 
be simulated by a deletion followed by an insertion. Similarly, one can shift a string by deleting its 
first character and inserting it at the end. 

Edit Distance is a very useful metric arising in several applications, most notably in string and 
text comparison problems, which are prevalent in computer science (e.g. compression and pattern 
matching) , computational biology, and web searching (see the papers [23 ^ |^ El 12111 ^^'^ 
the references therein, and the book jSUl for a discussion of applications to computational biology). 

Let ({0, 1}'^,ED) denote the space {0, l}'^ with the Edit Distance metric (inherited from the 
metric ED on {0, 1}*). A well known problem, stated e.g. in is whether the space ({0, l}*^, ED) 
embeds into Li with uniformly bounded distortion. Had this been true, it would have had significant 
applications in computer science (see 011). Most notably it would lead to approximate nearest 
neighbor search algorithms under Edit Distance, and to efficient algorithms for computing the Edit 
Distance between two strings (both of these problems are being solved, by computational biologists, 
every day, hundreds of times. Getting a substantially faster algorithm for any of them would be 
of great practical importance). In Section 01 we show that the Li embedding approach fails, by 
proving via Fourier analytic methods that 

ci({0,l}^ED)>^-^^^^. 

^ - 20(Vlog logo! log log log d) 

The previous best known lower bound is due to P, where it is shown that ci({0, 1}'^,ED) > 3/2. 
The best known upper bound on ci({0, 1}'^,ED) is due to ISlj, where it is proved that 

ci({0,l}^ED) < 2^(^'°s^^°s'°srf). 
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3) Flat tori can be highly non-Euclidean. The Nash embedding theorem states that any 
n-dimensional Riemannian manifold is isometric to a Riemannian sub- manifold of M?^. In the bi- 
Lipschitz category this is no longer the case- it is easy to construct Riemannian manifolds (indeed, 
even Riemannian surfaces) which do not embed bi-Lipschitzly even into infinite dimensional Hilbert 
space. However, all the known constructions were highly curved, and the possibility remained that 
any manifold with zero curvature embeds bi-Lipschitzly into L2, with a uniform bound on the 
distortion. In Section [S] we show that this isn't the case: there is an n-dimensional flat torus, i.e. 
M"/A for some lattice A C M", equipped with the natural Riemannian metric (whose sectional 
curvature is identically 0), such that ci(M"'/A) = Q,{^/n). This result answers the question, posed 
by W. B. Johnson, whether a Lipschitz quotient (in the sense of [7j) of Hilbert space embeds bi- 
Lipschitzly into Hilbert space. In 7 it is shown that a Banach space which is a Lipschitz quotient 
of a Hilbert space is isomorphic to a Hilbert space. Johnson's question is whether the condition 
that the quotient is a Banach space is necessary. Since the natural quotient map vr : M" M^/A is 
a Lipschitz quotient (see Sectional), the above example shows that Lipschitz quotients of Hilbert 
space need not embed into Hilbert space (indeed, they may not embed even into Li). Our approach 
is a variant of our study of quotient metrics in Section |31 and the proof is based on Fourier analysis 
over M", instead of discrete Fourier analysis over {0, 1}". 

This paper is organized as follows. In Section [2 we present some background and preliminary 
results on Fourier analysis on the Hamming cube. In section 13] we investigate quotients of the 
hypercube under group actions. In Section |1] we prove our lower bound on the Li distortion of 
Edit Distance, and in Section [21 we discuss the Li and L2 embeddability of fiat tori. We end with 
Section El which contains a brief discussion which relates the notion of length of metric spaces (first 
introduced by Schechtman [SH] in the context of the concentration of measure phenomenon) to 
nonembeddability results. This gives, in particular, new lower bounds on the Euclidean distortion 
of various groups equipped with a group invariant metric. 

2 Preliminaries on Fourier analysis on the hypercube 

We start by introducing some notation concerning Fourier analysis on the group = {0,1}'^. For 
e G (0, 1) we denote by fis the product e-biased measure on F2, i.e. the measure given by 

V X G F^, /x,({x}) = e^U (1 - e f-^U . 

For the sake of simplicity we write fj, = /ii/2- Given A C {1, . . . ,d} we define the Walsh function 

Wa{x) = (-l)^^-6-4'=^ 

Then {Wa ■ A C {1, . . . , d}} is an orthonormal basis of L2(F2, /u). In particular any / : — > ^2 
has a unique Fourier expansion 

AC{l,...,d} 
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where 

f{A) = I f{x)WA{x)dfi{x) G L2, 



and Parseval's identity reads as 

2 AC{l,...,d} 

Let Cj G F2 be the vector whose only non-zero coordinate is the jth coordinate. We also write 
e := ei + . . . + for the all Is vector. The partial differentiation operator on L2(F2) is defined by 

Since for every A C {1, . . . , d} we have that 



djWA 

we see that for every / : F2 ^ M 



-Wa jeA 
j^A, 



J2 I d,f{xfdn{x) = Wi^f- (1) 

— Iv^ — 

i=i^2 Ac|i,...,d| 



In what follows we denote by p the Hamming metric on F2, namely for x,y G Fg, 

P{x,y) ■■= \{j G {1, . . . ,4 : / yj]\ 
Observe that for every / : F2 — > M, 



\f{x)-f[x + e)\'d^ji[x) = 

id 

2 AC 

|A| = 1 mod 2 



^ 4/(A)2<4 \mAf = AYfi9^fi^)]'Mx)- 

— — — lw<i 



This famous inequality, first proved by Enflo in [26| via a geometric argument, implies that C2(F2) > 
Indeed, by integration we see that for every f : ¥2 ^ L2, 

d „ 

\\f{x)-f{x + e)\\ld^i{x)<AY / \\d,f{x)\\ld^i{x). 
Thus, assuming that / is invertible we see that 

J2 /„.„.. X2 



11/ Mlup 

i.e. 

ii/iiLip-iir'iiLip>^^. 

This Fourier-analytic approach to Enflo's theorem motivates the ensuing arguments in this paper, 
since it turns out to be remarkably flexible. For future reference we record here the basic Poincare 
inequality implied by the above reasoning: 
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Lemma 2.1. For every f -.¥2 ^ L2, 



I \\f{x)-f{y)\\ld^i{x)d,,{y) 



2 



/ \\^,f{x)\\ld^,{x) 



< 



mm{\A\ : /(A) / 0} 



Proof. We simply observe that 




□ 



3 Quotients of the hypercube 

Let (X, dx) be a metric space. For A,B <^ X the Hausdorff distance between A, B is defined as 



Following j^ZlEllini! given a partition U = {C/i, . . . , [7,^} of X, we define the quotient metric 
induced by X on denoted X/^, as follows: assign to each i,j G {!,..., A;} the weight = 
dx{Ui,Uj) = minxi^Ui, y<^Uj dx{x,y), and let dx/uiUi,Uj) be the shortest path distance between i 
and j in the weighted complete graph on {1, . . . ,k} in which the edge {i,j} has weight Wij. 

In the following lemma the right-hand inequality is an immediate consequence of 0, and the 
left-hand inequality follows from the fact that the Hausdorff distance is a metric on subsets of X. 

Lemma 3.1. Assume that U = {Ui, . . . , C/fc} is a partition of a metric space X such that for every 
i,j E {1, . . . , k}, for every x £ Ui there exists y £ Uj such that dx{x, y) = dx{Ui, Uj). Then for 
every i,j G {1, . . .,k}, 



A particular case of interest is when a group G acts on X by isometries. In this case the orbit 
partition induced by G on X clearly satisfies the conditions of Lemma 13.11 implying that for all 
x,y e X, 



where we slightly abuse notation by letting X/ G be the quotient of X induced by the orbits of G. 
This is the only type of quotients that we study in this paper. In particular, Lemma IXD implies 
that the quotients we study here are also Lipschitz quotients in the sense of [Jj (see Section 6 in |45j 
for an explanation). 

We will require the following lower bound on the average distance in quotients of the hypercube. 
Lemma 3.2. Let G be a group of isometries acting on with 2 < |G| < 2^^. Then 



HxiA, B) = sup {max{dx {a, B),dx{b, A)} : a e A, b e B} . 



(2) 



dx/u{Ui, Uj) = HxiUi, Uj) = dxiU, Uj). 



dx/G{Gx,Gy) = dx{Gx,Gy) 




6 



Proof. For every t > 0, 

fix fi{x,ye¥i: p(Gx, Gy) > t} > 1 - J] ^ x ^{x, y G yy) < t} = 1 - ^ • ^ T^V 

geG k<t ^ ^ 

We shall use the following (rough) bounds, which are a simple consequence of Stirling's formula: 
For every l/d < 5 < 1/2, 



Thus, using Lemma l3?T] we get that 

/ P^.,g(G^^ Gy)d^i{x)d^i{y) >5d(l- ^^2^d ■ [5\l - 5f 

Choosing 5 = Q \ '^-i°g2 l^^l ^ ] yields the required result. □ 

3.1 A simple construction of n-point spaces with ci = f2(logn) 

In what follows we refer to |411lllj for the necessary background on coding theory. Let C C {0, l}'^ 
be a code, i.e. a linear subspace of Fj. Denote by w{C) the minimum Hamming weight of nonzero 
elements of C, i.e. 

wiC) = min 

a;6C\{0} 

We also use the standard notation 

:= |x G : ^ y e C, {x, y) = mod 2} , 

where (x,y) := YJj=iXjyj- 

Lemma 3.3. Assume that f : ¥2 ^ L2 satisfies for every x € Fg and y G C"*", f{x + y) = f{x). 
Then for every nonempty A C {1, . . . , d} with \A\ < w{C), f{A) = 0. 

Proof. Since (C-*-)-*- = C (see ^2), 1^ ^ (C-*-)-*-, implying that there exists v G C"*- such that 
= 1 mod 2. Now, 



/(A) = / f{x)WA{x)d^x{x) 

f{x + v)WA{x)d^M{x) 
f{x)WA{x — v)dn{x) 

= (_i)(iA,.> f f(^x)WA{x)dfi{x) 
Jf^ 

= -m- 

So f{A) = 0. □ 
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Theorem 3.4. Let C <^¥^ be a code. Then 



d + d log 



d 

dim{C) 



Proof. Let / : ¥^/C^ ^ Li be a bijection. Define / : ^ Li by f{x) = f{x + C^). It is well 
known j25j that there exists a mapping T : Li — > L2 such that for all x, y G Li, 

\\T{x)-T{y)h = ^J\\x-y\\^. 

Define /i : F2 — > i>y h = T o f. By Lemma 13.31 and Lemma l2. II we get that 

/ \\f{x) - f{y)\\idis{x)dis{y) = [ WHx) - h{y)\\ldfiix)dfiiy) 



L 2 2 

d 



< 



^ ^ / \\djh{x)gd^,{, 



d „ 

^ / \\dj{x)hdf,{ 



w{C) 



3 

d 



< ^ll/llL,p. (4) 



On the other hand, by Lemma 13.21 we see that 



\\f{x)- f{y)\\id,,{x)d,,{y) = / \\f{x + C^)-f{y + C^)hd^x{x)d^x{y) 



> 



111 / Pw^/c^i^ + C ,y + C )dfi{x)dfi{y) 

iLip Jf^xF^ 



n 



d-logolC^ 



^1 a |-p^' 



dim{C) ^ 

where we used the fact that \C^\ = 2'^~^''"^^\ 

Combining @ and © yields the required result. □ 

Corollary 3.5. There exists arbitrarily large finite metric spaces X for which ci{X) = rJ(log \X\). 

Proof Let C C {0, l}'^ be a code with dim(C) > f and w{C) = n{d). Such codes are well known to 
exist (see |41|). and are easy to obtain via the following greedy construction: fix < d/A and let V 
he a k dimensional subspace of with w{V) > 6d. Then V contains 2'^ points. The number vectors 
X e F^ with ||x + t;||i < 6d for some u G F is at most 2^ J2e<5d Ce) < 2*^+^^/^ • [5\l - 5)^-^]''^. 
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It follows that there exists 6 = $1(1) such that for every k < d/A there exists x S such that 
w{spa.n{V U {x})) > 6d, as required. Now, for C as above, Theorem 13.41 implies that 

□ 

Remark 3.1. Using the Matousek's extrapolation lemma for Poincare inequalities (see also 
Lemma 5.5 in 6 ), it is possible to prove that for a code C as in Corollarv 13.51 for every p > 1, 
Cp{¥i/C^) > c{p)d. 

3.2 The relation to transportation cost 

Given a finite metric space {X, d) we denote by 'P{X) the set of all probability measures on X. For 
a,T £ V{X) we define 

U{a,T) = i^TT eViX X X) : \fx e X, J dTT{x,y) = a{x), and J (i7r(y, x) = r(x)| 

The optimal transportation cost (with respect to the metric d) between a and r is defined as 



Td{a,T) = inf / d{x,y)d-K{x,y). 

7r6n(o-,T) JxxX 



Given ^ C X we denote by fiA £ ^(^) the uniform probability measure on A. A, B X have 
the same cardinality then a straightforward extreme point argument (see fSQi) shows that 

Xi{^A, fJ-B) = inf I J d{a, f{a))dfiA '■ f '■ ^ ^ B isl — 1 and onto| . 

Lemma 3.6. Let G be a finite group, equipped with a group invariant metric d (i.e. d{xg,yg) = 
d{x, y) for all g,x,y £ G). Then for every subgroup H (1 G and x,y G G, 

dG/H{xH,yH) = %i{fJ.xH,f^yH)- 
Proof. For every bijection / : xH yH, 

d{g,f{g))dfixH{g) > d{xH,yH) = dG/H(.xH,yH). 

xH 

On the other hand, fix /ii,/i2 G H such that d{xhi,yh2) = d{xH,yH). Then the mapping 
/ : xH yH given by f{g) = yh2h^^x^^g satisfies for all g S xH, d{g,f{g)) = d{xhi,yh2) = 
d{xH,yH), implying the required result. □ 

Corollary 3.7. It follows from Corollary\^ and Lemma that ci{V{¥^),Tp) = n{d). This 
matches the upper bound proved in \21l In fact, from Remark \3.1\ we see that for all p > I, 
Cp{V{¥i),Tp)>c{p)d. 

Remark 3.2. Let Lip(F2) be the Banach space of all functions / : F2 ^ M satisfying /(O) = 0, 
equipped with the Lipschitz norm || • ||Lip. By Kantorovich duality (see [SD]), ('^(^^'2)' i^ isometric 
to a subset of the dual space Lip(F^)*. It follows that ci(Lip(F^)*) = n{d). As remarked in the 
introduction, the fact that sup^^gj^ ci(Lip(F2)*) = cxd was first proved by Bourgain 14 using a 
different argument (which yields a worse lower bound on the distortion). 
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3.3 Actions of transitive permutation groups 

Let G < 5^ be a subgroup of the symmetric group. Clearly G acts by isometries on via 
permutations of the coordinates. 

Theorem 3.8. Let / : F2 — > -Li he a G-invariant function, where G is transitive. Then 
I \\f{x) - f{y)hd^i{x)d^i{y) <-^Y, [ \\d,f{x)hdf^{x). 

Proof. Let A C F2 be a G invariant subset of the hypercube and write ij.{A) = p. For f = 1a the 
required inequality becomes: 

in 

where Ij{A) = fi{x £ Fg : + ej} n A\ = 1}| is the influence of the jth variable on A. 

By |33j . maxKj^d Ij (A) > -^(1 — p). But, since A is invariant under the action of a transitive 
permutation group, Ij{A) is independent of j, so © does indeed hold true. 

In the general case let / : — > -Li be a G invariant function. Denote by vr : Fg ^ F2/G the 
natural quotient map, i.e. tt{x) = Gx. Since / is G-invariant, there is a function h : F2/G Li 
such that f = hoTT. By the cut-cone representation of Li metrics (see there are nonnegative 
weights {^a}ac¥^/g such that for every x,y G F2, 

\\fix)-f{y)h = ||Mvr(x))-/^(vr(y))||i 

= Yl AA|lA(vr(x))-l^(7r(y))| 

AC¥^/G 

= E Xa\K~i{a){x) - K-i{A){y)\- 

AC¥^/G 



Observe that for every A C F2/G, vr ^{A) C F2 is G-invariant. Thus by the above reasoning 




dfi{x) 



□ 



10 



We thus get many examples of spaces which do not well-embed into Li: 
Corollary 3.9. Let G be a transitive permutation group with \G\ < 2^^, for some e G (0, 1). Then 

Proof. This is a direct consequence of Theorem 13.81 and lemma 13.21 □ 

Remark 3.3. It is possible to obtain slightly stronger results analogous to Corollary when we 
have additional information on the structure of the group G. Indeed, in this case, in the proof of 
Theorem 13.81 one can use the results of Bourgain and Kalai ^Hl on the influence of variables on 
group invariant Boolean functions, instead of using j33| . 

Remark 3.4. We do not know if (F2, IHb)/^ embeds bi-Lipschitzly in Hilbert space with uniformly 
bounded distortion. This seems to be unknown even in the case when G is generated by the cyclic 
shift of the coordinates. This problem is interesting since if this space does embed into Hilbert 
space, then the results of this section will yield an alternative approach to the recent solution of 
the Goemans-Linial conjecture in j35j . 



4 Edit Distance does not embed into Li 

In this section we settle the Li embeddability problem of Edit Distance negatively, by proving the 
following theorem: 

Theorem 4.1. The following lower bound holds true: 

ci(F^,ED) > 



O ( Vlog log d log log log 



The following lemma is a useful way to prove Li nonembeddability results. The case 5 = 
of this lemma is due to ,40;. Variants of the case 6 > 0, which is the case used in our proof 
of Theorem 14. H seem to be folklore. We include here the formulation we need for the sake of 
completeness (the main part of the proof below is a variant of the proof of Lemma 3.6 in ^^T|). 

Lemma 4.2. Fix a > and < 6 < ^. Let {X, d) be a finite metric space, a a probability measure 
on X , and r a probability measure on X x X . Assume that for every A Q X with 6 < o'{A) < | 
we have that r({(x, y) € X x X : y} n ^| = 1}) > aa{A). Then, 

c^(X) > - -^-y XX djx, y) da{x)da{y) - 25 diam(X) 
^ ~ 2 J^^^d{x,y)dT{x,y) 

Proof. We claim that there exists a subset y C X with cr{Y) > 1 — 5 such that for every f : Y ^ Li, 
\\f{x)-f{y)\\,da{x)da{y) < - f ||/(x) - /(y)||idr(x, y). (7) 

YxY CH JyxY 
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This will imply the required lower bound on ci{X) since if / : X — > Li is a bijection then 



||/(x)-/(y)||idr(x,y) < ||/||Lip / y) dr(x, y), 

YxY JXxX 



while 



\\fix)-fiy)\\idaix)da{y) > / y) 

YxY 11/ IlLip \JxxX 

d{x, y) da{x)da{y) 
XxiX\Y) J 



- Ilf-l 



J-. ( / d{x,y) da{x)da{y) — 25diam(X) j 

IlLip \JxxX J 



It remains to prove the existence of the required subset Y. For simplicity we denote for every 
A,B CX, 

{A,B} = e X x X : {x, y} n A / A {x, y} n B / 0}. 

Define inductively disjoint subsets = Wq, Wi, . . . , Wk C X as follows. Having defined Wi, . . . ,Wi, 
write Yi = D^^^^Wi and let Wj+i C X \ 1^ be an arbitrary nonempty subset for which 

t{{W^+i,X\ (y, U Wi+i)}) < aa{Wi+i) < ^a{X \ Yi). 

If no such Wj exists then this process terminates. We claim that cT(Yfc) < 5. Indeed, otherwise let 
j be the first time at which cr(Yj) > 6. Observe that 

By our assumptions it follows that T{{Yj,X\Yj}) > aa(Yj). But from the following simple inclusion 

{j j "j j 

\JW„X\\JwA O \J{W,,X\iYi^iUWi)} 
i=l i=l ) i=l 

we deduce that 

i 3 
< aa{Yj) < Ti{Yj,X\ Yj}) < Y,r{{Wi,X\ (K,_i U W,)}) < ^aaiWi) = aaiYj), 

1=1 i=l 

a contradiction. Thus, taking y = we see that for every A (1 Y with (j{A) < ^ we have 
t{{A, Y \ A}) > aa{A). In other words, 



[ ||lA(x)-lA(y)||idT(x,y) = t{{A,Y\A}) 
Jyxy 

> aa{A) 



> aa{A)[a{Y) - a{A)] 

= f / ||/(x)-/(y)||ida(x)da(y), 

^ JYxY 

which implies (jT)) by the cut cone representation of Li metrics (as in the proof of Theorem 13. 8(1 . □ 
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In what follows we let S denote the cyclic shift operator on Fg, namely 

S{xi, ...,Xd) = {Xd,Xl,X2, . . .,Xd-l). 

We will also use the following remarkable theorem of Bourgain [TH]. The exact dependence on the 
parameters below is not stated explicitly in but it follows from the proof of jl5j . Since such 
quantitative estimates are useful in several contexts, for future reference we reproduce Bourgain's 
argument in Section [3 while tracking the bounds that he obtains. 

Theorem 4.3 (Bourgain's noise sensitivity theorem [155). Fixe, 6 S (0, 1/10) and a Boolean 
function / : ^ {-1,1}- ^ 

ssume that 

^ il-e)\^\fiAf>l-6. 

AC{l,...,d} 

Then for every j3 > there exists a function (7 : F2 ^ K which depends on at most coordinates, 
such that 

[ (fix)- g{x)fdfi{x) < 2Vi°g(V'5)iogiog{i/e) . (_L + 4ye^ 
hi Vve 

Here c is a universal constant. 

Lemma 4.4. There exists a universal constant C > such that for every e £ (0,1/10), every 
integer k > lO"^^^^ , and every / : Fg ^ 1}; 

\f{x) - f{y)\dfi{x)dfi{y) - 2-Vi°s{i/^)iogiog{i/.) 



< rr E/ \fix)-fiS^{x)+y)\df,ix)df,M- (8) 



2CVlog(l/£)loglog(l/£) ^ 

j = l •"^2'^'^2 

Proof. Observe that for every x, y G Fg, \ f{x) — f{y)\ = 1 — f{x)f{y). Thus for every j G {1, . . . , d} 
/ \f{x) - fiSHx) + y)W{x)d^iM = 1 - / f{x)f{S^{x) + y)df,{x)df,M 

Jf^xF^ JFfxFf 

fiA)f{B)WA{x)WB{S^{x) + y) j d^,{x)d^^,{y) 

^A,_BC{l,...,d} / 

Y J{A)f{B)WA{x)Ws-.i^B){x)WB{y) I dii{x)diiM 



= 1 


-/ 




JF^xF^ 


= 1 


-/ 




JF^xF^ 


= 1 


- E 




AQ{1,... 


> 1 


- E 



l-2er\f{A)\ (9) 

AC{l,...,d} 
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where we used the Cauchy-Schwartz inequahty and the facts that for all B C {1,. . . ,d} we have 
Jw^^WB{y)d^ie{y) = (1 - 2e)l^l and f^,WAWs-,,^B)df^ = when B / S^{A). Averaging ® over 
j = 1, . . . ,k we see that 



IY. f \m-f{S^x)+y)\df^{x)df^s{y)>l- E (l-2e)l^l/(^)'. 



Thus, in order to prove ((SJ we may assume that 

Y: (l-2e)l^l/(^)2>l-— _^_=, (10) 

AC{l,...,d} 

where c > 2 is as in Theorem 14.31 Now, inequality (|1U() . together with Theorem 14.31 fwith f3 = 
lg"i/e) implies that there exists an integer t < 20^^^, a function g : ¥\ ^ { — 1,1}, and indices 
1 < ii < 12 < ■ ■ ■ < it ^ d such that if we extend g to a function g -.¥2 ^ 1} by setting 

g{xi , . . . , Xd) = g{xi^ , Xi2 , • • • , Xi^ ) , 

then 



- < / {fix) - g{x)f dii{x) < 2-Viog(Ve)iogiog(i/e)_ 



Write / = {ii , . . . , it} and for j G {1, . . . , d} define I + j = {ii + j mod d, . . . ,it + j mod d}. If 
/ n (/ + j) = then we have the identity 

/ laix) - g{S^{x) + y)\dfi{x)dfieiy) = / \gix) - g{y)\d^{x)d^{y). 

Assume that k > 2t^. In this case 

\{je{l,...,k]: in{i + j)=(l>}\>k-t^>^. 

Thus 

\ ^ 1 

7 / 15(3;) - g{S^{x) + y)\dfi{x)dfis{y) > o / 15(2;) - g{y)\d^{x)d^{y). 

j=l J^i>^^i ^ JFf xF| 

It follows that 

r E / - /(^'(^) + V)W{x)di,,{y) >\\ \f{x) - f{y)mx)df,iy) - 

j=l -^PfxPi JFf xFf 

3 / \f{x)-g{x)\di,{x) 



^ ji 2 

> ^/ - f{y)mx)d^,{y) - 3 • 2-Vi°g{iA)iogiog{i/.)_ 

^ JF^xF^ 



This completes the proof of ((SJ. □ 
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We require the following rough bound on the average edit distance on Fg. Such simple estimates 
have been previously obtained by several authors, see for example Lemma 8 in |Hj. 

Lemma 4.5. We have the following lower bound on the average Edit Distance on Fg; 

ED{x, y)dfi{x)dfi{y) > 

Proof. For every x G F2 and every integer r < d/2, 

\{ye¥i: ED(x,y)=r}|<2^p^ 

This is best seen by observing that any sequence of r insertions or deletions can be executed in 
a sorted order, that is, the indices of positions on which the operation is performed increases. 
There are at most (^^^^ ways to choose the r locations of these edit operations, and 2^ possible 
insertion/deletion operations on these r locations. 
Now, 

r 



^x/i({(x,y) GF^xF^ ED(x, y) > d/16}) > 1 - 1 2^ (^^^ 



> 1 



r<d/ie 

- . 2'^/8 . 2i '"^ 
2d 



(1/16)^/^^(15/16)^^/^'' 



-2d 



> 



□ 



Proof of Theorem \4-l\ Let C be the constant in Lemma |4.4I Fix e G (0,1/10) such that ed > 
IQ"^^/^ — 1^ and an integer ed > k > 10^°/^. Define a distribution r on Fg x F2 as follows: pick a 
pair {x,y) G F2 x F2 according to the measure n x /x^, pick j G {1, . . . , fc} uniformly at random, 
and consider the random pair (x, {x) + y). This induces a probability distribution r on X F^. 
Observe that 

ED(x, S^{x) + y)< 2p(0, y) + 2j < 2p(0, y) + 2ed. 



Thus 



ED(x,y)(ir(x,y) 



1 



Ff xF| 



ED(x,5^'(x) +y)dT(x,?/) 



< 2e(i + 2^ ('^ye'^(l-e)'^-'^ = 4ed. 



(11) 



Lemma 14.41 implies that for every ^4 C Fg, 



,0 



(Vlog(l/£)loglog(l/£)) 



T({(x,y) GF^xF^ |{x,y}n^| =1}) 



> 2/i(A)[l - /i(A)] -3-2" 



Vlog(l/£)loglog(l/£) 
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Thus, the conditions of Lemma 14.21 hold true with the parameters 5 = 3-2 \/iog(i/£) log iog(i/£) 

•e) lo 



a = ^ . Hence by and Lemma BTol 



ci(F^,ED) > , , . 8° ^ -. 

20(Vlog(l/e)loglog(l/£)) 4e(i 

This impUes the required result when we choose e « logd' '— ' 

5 Flat tori which do not embed into Li 

Let A C M" be a lattice in R" of rank n. The quotient space M"/A is a Riemannian manifold 
(n-dimensional torus) whose curvature is identically zero. Nevertheless, we show here that it is 
possible to construct lattices A such that ci(M"/A) = For a lattice A C M" we denote its 

fundamental parallelepiped by Pa- The dual lattice of A, denoted A*, is defined by 

A* = {x G M" : Vy G A, {x,y) G Z}. 

We shall use the following notation 

A^(A) = min llxllo and r( A) = max min llx — ■yllo. 

xeA\{0} xm" y€A 

In words, A^(A) is the length of the shortest vector in A, and ^(A) is the smallest r such that balls 
of radius r centered at lattice points cover M". 

Theorem 5.1. Lei A C be a lattice. Then 

Corollary 5.2. Let A be a lattice such that A* is almost perfect, i.e. N{A*) = 1 and r(A*) < 4, say. 
Such lattices are well known to exists (see \61\ Then Theorem, 15. 1\ implies that ci(M"'/A) = 

Q{^/n). This is, in particular, an example of a Riemannian manifold whose curvature is identically 
zero which does not well-embed bi-Lipschitzly into £2. This fact should be contrasted with the Nash 
embedding theorem which says that any n-dimensional Riemannian manifold is isometric to 
a Riemannian submanifold ofM?'^. 

Remark 5.1. Some restrictions on the Lattice A should be imposed in order to obtain a torus M^/A 
which does not embed into £2. Indeed, the mapping / : R"/Z" — > C" defined by /(xi, . . . ,x„) = 
^^2ttixi ^ . . . , e^'^*^") has distortion 0(1). We leave open the interesting problem of determining the 
value of ci(M"/A) and C2(M'^/A) as a function of intrinsic geometric parameters of the lattice A. In 
Theorem 15.81 below we show that for every n 

Ln := sup{c2(M"/A) : A C M" is a lattice} < 00. 

Corollarv 15.21 shows that L„ = ^}{y/n), while the upper bound obtained in Theorem 15.81 is -L„ = 
0(n^"/^). It would be of great interest to close the large gap between these bounds. 
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The proof of Theorem 15.11 will be broken down into a few lemmas. In what follows we fix a 
lattice A C and denote by m the normalized Riemannian volume measure on the torus M"/A. 
Given a function / : R^/A — > Li we also think of / as an A- invariant function defined on M". We 
refer to l59] for the necessary background on Fourier analysis on tori used in the ensuing arguments. 

Lemma 5.3. Let 7 denote the standard Gaussian measure on R", i.e. d'y{x) = (27r)"/2 e'l^^Hz/^. 
Then for every continuous f : R"/A Li, 

11/(2;) - f{y)\\idm{x)dm{y) 

J"/A)x(R"/A) 

^ --. zi^TI^ / / \\f{x) - f{x + y)\\idm{x)dj{y). 



Proof. By integration it is clearly enough to deal with the case of real-valued functions, i.e. / : 
R^/A R. Moreover, we claim that it suffices to prove the required inequality when / takes values 
in {0, 1}. Indeed, assuming the case of / : R^/A — > {0, 1}, we pass to the general case as follows: 

\f{x) - f{y)\dm{x)dm{y) 

(IR/A)x(M"/A) 



VA)x(R"/A) 

< 

- 1 - e-^ 

1 



\^{~oo,t]{fix)) - l(-oo,i](/(y))MM dm{x)dm{y) 
T^^e^^^^W^ Ln A (/ \'^(-'°°A(f(^)) - ^i~oo,t]{f{x + y))\dt^ dm{x)dj{y) 



"/A 

- /(x + y)\dm(x)d'y{y). 



l-e-2-^[^(A*F VA-ZA 

So, it remains to prove the required inequality for a measurable function / : M"/A {0, 1}. The 
function / can be decomposed into a Fourier series indexed by the dual lattice A*: 

fiy) = E 

xeA* 

where 

fix) = [ f{y)e-^^'^-^yUm{y). 

Using the fact that \ f{x) — f{x + y)\ = f{x) + f{x + y) — 2f{x)f{x + y) we get from Parseval's 
identity that for every y G R*^ 

\f{x)-f{x + y)\dm{x) = 2/(0) -2/ ( V /(n)/(t;)e2-««'^)+<'^'^+?^)) 1 dm(x) 

= 2/(0) -2 E e2->'^)|/H|2. 

uieA* 

Integrating with respect to the Gaussian measure, and using the identity Jj^„ e^'^*^"''^^(i7(y) = 
g-27r2i|u,l|2^ we get that 



/ / |/(x)-/(x + y)|(im(x)d7(y)=2/(0)[l-/(0)]-2 J] e-2-'IHIi|/(^)|2. (12) 
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On the other hand, since / is Boolean function we have the identities: 



/ |/(x)-/(y)|dm(x)dm(y) = 2/(0)[l-/(0)] = 2 V \f{w)\^. (13) 

J(RVA)x(RVA) ^eA*\{0} 



Combining ()12|) and H13|) we get 



/ / \f{x)-f{x + y)\dm{x)dj{y) = 2 V (l - g-^-'Hli 

> 2 (l - e-2-^[^(^*)l^) I/HI' 

weA*\{o} 

= (l-e-2-'[^(^*)]') / |/(x)-/(y)|dm(x)dm(y). 
^ ^ J(R"/A)x(M"/A) 



Lemma 5.4. For ewery lattice A C M", 

/ dj^n/j^{x,y)dm{x)dm{y) _ 

y(R"/A)x(R"/A) ibr(A 

Proof. Let Va be the Voronoi cell of A centered at 0, i.e. 

VA = {xeR'' : \\x\\2 = d{x,A)}. 



I/HP 



□ 



n 

> 



Denote by B2 the unit Euclidean ball of centered at 0. Then by the definition of r(A*) we have 
that Fa* C r{A*)B^. Hence vo^Fa.) < [r(A*)]"vol(-B^). It is well known (see UHlllZlllll) that 

vol(yA) • vol(yA*) = voI(Pa) • vo1(Pa*) = 1. 

Thus 

voI(Fa) > 



[r(A*)]"vol(fi^) 
It follows that 

/ d^r./x{x,y)dm{x)dm{y) = } [ \\x\\2dx 

J(R"/A)x(R"/A) VOII^VaJ Jva 

vo1({xG^a: IN|2>8;:^}) 



n 

> 



8r(A*) vol(Vy 



A 



n I n 



n 

- 16r(A*)' 



□ 
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Proof of Theorem \5.1\ If / : M^/A Li is bi-Lipschitz then 



\\f{x) - f{x + y)\\idm{x)dj{y) < ||/||Lip • / / dKn/f,{x,x + y)dm{x)dj{y) 

"/A JM" JM"/A 



< ll/llLip- / hhdjiy) 

< \\f\\up-V^. 
On the other hand, using Lemma 15.41 we see that 

\\f{x) - f{y)\\idm{x)dm{y) > — -i- / d^^ jjy{x,y)dm{x)dm{y) 

r/A)x(R"/A) 11/ IlLip J(M"/A)x(M"/A) 

1 n 

> 



||/-i||Lip 16r(A*) 
so by Lemma we deduce that 

'l _e-2^'[^(A*)P 



iLip • 11/ ^IlLip = ^ 

It follows that for every t > 0, 



r(A* 



/l_ -2^2[7V({tA)*)]2 \ ^_ -2^^[N(h*r/t^ \ 

c.(M°/A) = c.(R-/(iA))=»^ .((tA)-) KA^IA 

Optimizing over t yields the required result. □ 

If one is interested only in bounding the Euclidean distortion of ]R"/A, then the following lemma 
gives an alternative proof of Theorem 15. II (in the case of embeddings into L2). 

Lemma 5.5. For every continuous f : M"/A — > L2, 

[ Wfix) - f{y)gdm{x)dm{y) < ^ / \\V f {x)gdm{x) . 

J{R"/A)x(R"/A) [Af(A*)J JR"/A 

Proof. By Parseval's identity 



\\Vfix)gdmix) 

'VA 



^ f f ^ f \ ^ 



xeA 



a; 



|2 
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2,'eA*\{o} 

i*m2 / II ?/nMl2 



[iV(A*)]^ / \\f{x)-fmidm{x) 

JIR"/A 

t^^'^*^^' ^ ||/(x)-/(y)||2dm(x)dm(y). 

/A)x{M"/A) 



□ 
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Lemma [^yields the lower bound C2(M"/A) = Q (^^^^ ■ as follows. Let / : M"/A L2 
be a bi-Lipschitz function. Since L2 has the Radon-Nikodym property, / is differentiable almost 
everywhere (see |1U|'). Now, by Lemma 15.51 



/ 

J a 



:"/A)x(K"7A) 



- f{y)\\ldm{x)dm{y) < 



t V 



f 


Of 




dxj 



dm{x) 



< 



[N{K*)\ 

On the other hand, arguing as in the proof of Theorem 15. IL we get 



n 



2 

Lip' 



/ 



'/A)x{R"/A) 



\\f{x) - f{.y)\\ldm{x)dm{y) > 



1||2 /, 
T.in JfM" 



It follows that 



C2(M"/A) = 



11/ IlLip 
1 

11/ "^IIlip 

/iV(A*) 
V r(A*) ■ 



/A)x(R"/A) 



dRn/x{x,yfdm{x)dm{y) 



The following corollary of Lemma 15.51 will not be used in the sequel, but we record it here for 
future reference. 

Corollary 5.6. For every continuous f : W^/A — > R, 

/ - f{y)\dm{x)dm{y) < |^ / \\V f{x)hdm{x). 

J{R"/A)x(R"/A) ^^V^)JR"/A 

Proof. Lemma E3] implies that Ai(M"/A) > [iV(A*)] , where Ai(M"/A) is the smallest nonzero 
eigenvalue of the Laplace-Beltrami operator on M^/A. Since R^/A has curvature 0, an inequality 
of Buser [19; implies that Ai(R"/A) < 10[^(R"/A)]^ where h{W/A) is the Cheeger constant 
of R^/A (Buser 's inequality can be viewed as a reverse Cheeger inequality ^2! when the Ricci 
curvature is bounded from below). Thus /i(M"/A) > A^(A*)/\/To, which is precisely the required 
inequality. □ 



We end this section by showing that there exists a constant Dn < 00 such that for any rank n 
lattice A C R", C2(R'"/A) < D^- 

Lemma 5.7. Every rank n lattice A C M" has a basis (over 11^) xi, . . . ,a:„ such that for every 
ui,U2, . . . ,n„ G R, 



1/2 



n 



(3n-l)/2 



^UjWxjWl 



< 



1/2 



(14) 
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Proof. Let {xi, . . . , Xn} be a basis of A, and denote by A the matrix whose columns are the vectors 
II ^^11 , . . . , I I ^"ii . If we let ixi, . . . be the Korkin-Zolotarev basis of A, we can ensure that 

||a;i||2' ' |f„||2 ' "J ' 

(see ISni): 

Idet(^)l>;^- 

Denote by si{A) > S2{A) > ■ ■ ■ Sn{A) > the singular values of A. Given a vector u = 
(ui, . . . ,Un) G M" we have by the Cauchy- Schwartz inequality that 



\Au\ 



2 



— ^ l^il — ■ ll^lb- 

2 



This proves the right-hand side of (|14j) . and also shows that si{A) < ^Jn. Now 

n 

— < I det(A)| = n ^i(^) < si(^) • \sn{A)f-^ < si{A) • n("-i)/2^ 

i.e. si(^) > n-(3"-i)/2. It follows that for every u € i?'', Pti||2 > si(^)||n||2 > n-(3"-i)/2||u||2, 
which is precisely the left-hand side of ()14|). □ 

Theorem 5.8. Let A C be a lattice of rank n. Then R"/A embeds into M^" with distortion 
Proof. Let {xi, . . . , Xn} be a basis as in Lemma E7| Define / : M" ^ C" by 

/ (^E^J-^^j = (lkill2e'"'"S---,lkn||2e2--"). 

Since / is A-invariant, we may think of it a a function defined on the torus ]R"/A. For every t E M 
let m{t) be the unique integer such that t — m{t) £ [—1/2, 1/2). Given u,v € M", 



l-^J Il2 



2 [1 — cos(27r(nj — Vj))] ■ ||xj||2. 
i=i 



Since for every t S M, 
we get that 



[t - m{t)]^ [t - m{t)]^ 

< 1 — cosizvrr) < . 

12 - V ; - 2 ■ 
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On the other hand, by ((Til) . 

(n n 
i=i j=i 



< 



^{uj - Vj)xj - m{uj - Vj)xj 



1/2 



In the reverse direction, let mi, . . . , m„ S Z be such that Yl^=i ''^j^j S A is a closest lattice point 
to n — t;. Then 



VjXj 



- Vj - mj]x^ 



> 



> 



n 



l2 II |||2 

Vj-nij] ■ \\xj\\\2 



n 



1^ I^ek- 



1/2 



It follows that / has distortion 0(n^"'/^). 



□ 



6 Length of metric spaces 

The following definition, due to G. Schechtman |58j . plays an important role in the study of the 
concentration of measure phenomenon and Levy families [481 137| . 

Definition 6.1. Let {X,d) be a finite metric space. The length of{X,d), denoted i{X , d) is the least 
constant £ such that there exists a sequence of partitions of X, P^,P^, . . . , P-^ with the following 
properties: 

1. For every i>l, P' is a refinement of P^^^ . 

2. P° = {X} and P^ = {{x} : x £ X}. 

3. For every i > I there exists Oj > such that if A £ p«-i q,^^ B,C G are such that 
B,C A, then there exists a one-to-one onto function (p = <j)B,c ■ B ^ C such that for every 
X £ B, d{x,(/){x)) < Oj. 



4. i = x/El,o^- 



22 



For p > 1 we can can define an analogous concept if we demand tliat i = {^f^i . In tliis 

case we call the parameter obtained the tp length of (X, d), and denote it by (.p{X^d). Observe 
that it is always the case that lp{X, d) < diam(X). 

Recall that for p G [1,2], a Banach space Y is called p-smooth with constant S if for every 

||x + yfy + ||x - yfy < 2\\xfy + 2S^yfY. 

The least constant S for which this inequality holds is called the p-smoothness constant of Y, and 
is denoted Sp{Y). It is known gj that for q>2, S-^iLq) < ^/q^, and for q G [1,2], Sq{Lq) < 1. 
The following theorem relates the notion of length to nonembeddability results. 

Theorem 6.2. Let (X, d) he a metric space and Y a p-smooth Banach space. Then 

i/p 



CY{X,d)>-^^ 

In particular for 2 < p < oo, 

Cp{X,d) > 



1 



/p-Sp{Y)ep{x,d) \ \x 



\ VP 



1 



\ 1/2 

-2 ^ d{x,yf 

I x,y&X J 



i{X, d)^2p-2 \ \X 



Proof. Let {-P' j^O' {'^i}iLi be as above, and denote by J^i the cr-algebra generated by the partition 
P*. In what follows all expectations are taken with respect to the uniform probability measure on 
X. Given a bijection / : X — > y we let /j = E,{f\J^i). In other words, if ^ G P* and x £ A then 



Now {/ili^o ^ martingale, so by Pisier's inequality ^SSj (see Theorem 4.2 in 50 for the constant 
we use below), we see that 



Af-l 



3=0 



Now /o = E/ and fN = f- Thus 



Ell/iv - M'y 



X ^ 



> 



1 



/(^) - M ^ ^^y^ 



yex 



Y 



2P-HX|2 



x,yeX 



\x\ 

'Lip ' ' x,y(^X 
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On the other hand fix j e {0, . . . , iV - 1}, and A e , B e P^+^ such that x e B <Z A. Then 

fjix) - = ^ E /(y) - M ^ ^^^^ = ^ ^ I - /(y)] I • 

' ' y€A ' ' yeB ' ' \j/gC / 

So by convexity 

- fj+lix)\\Y < WfWup ■ fflj+l- 

It follows that 



1 /^p^Ex,s/exf^(^.yF^ 



1/p 



■ Sp{Y)ep{X,d) \ \x\ 



□ 



As shown in [^EZj, if we consider the group of permutations of {1, . . . , n}, S'„, equipped with 
the metric d{a,7r) = \{i : a{i) / T(i)}|, then £{Sn,d) < 2-y/n, while diam(S'n) = 0(n). It follows 
from Theorem lf).2l that C2(5'„) = ^l{^/n). On the other hand, by mapping each permutation n G Sn 
to the matrix we see that C2{Sn) = 0{y/n). Thus 



C2{Sn) = e 



log 1 5, 



n 



log log I Sr, 



Similar optimal bounds can be deduced for Cp{Sn)-, p > 1- 

The metric d on Sn is equivalent to the shortest path metric induced by the Cayley graph on 
Sn obtained by taking the set of all transpositions as generators. It is of interest to study the 
Euclidean distortion of metrics on Sn induced by Cayley graphs coming from other generating sets. 
Recently in [31] it was shown that there exists a bounded set of generators of Sn with respect to 
which the Cayley graph is an expander (this settles a long standing conjecture- see j57jl. It follows 
that there exists a set of generators of Sn with respect to which the metric induced by the Cayley 
graph has Euclidean distortion r2(log \ Sn\) = r2(lognloglogn). 

Another example discussed in [481 137j is the case of the Hamming cube. In this case l(¥2, p) = 
0{Vd), and so Theorem l6. 21 implies that for p > 2, Cp(F2,p) > c{p)y/d. This result was first proved 
in pj. 

More generally, let G be a finite group equipped with a translation invariant metric d. Let 
G = Go 3 Gi 5 • • • 5 Gn = {e} be a decreasing sequence of subgroups. Then it is shown 
in dBl EH that 



i{G,d) < 



E [diam(G,_i/Gi)]'. 
This estimate implies a wide range of additional nonembeddability results. 
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7 Appendix: Quantitative estimates in Bourgain's noise sensitiv- 
ity theorem 

In this section we prove theorem 14.31 The proof is a repetition of Bourgain's proof in with an 
optimization of the dependence on the various parameters. Since such quantitative bounds are very 
useful, and they are not stated in JH]) we beheve that it is worthwhile to reproduce the argument 
here. 

Theorem 7.1 (Bourgain's distributional inequality on the Fourier spectrum of Boolean 
functions). Let / : ^ {0, 1} be a Boolean function. For 2 < k < d and [3 € (0, 1) define 



JFt= { 



AC{l,...,d} 
\A\<k, jeA 



Then 



where 



AC{l,...,d} 

\A\<k, A\Jp^% 



5 = /(^)' 



Ac{i,...,4 

|A|>fe 



and C is a universal constant. 



The quantitative version of Bourgain's noise sensitivity theorem, as stated in Theorem 14.31 
follows from Theorem 17. 11 Indeed, we are assuming that / : ^ {-1, 1} satisfies 

Y {i-erw>i-5. 

AC{l,...,d} 

By Parseval's identity, ^^c{i d) /(^)^ ~ 1- Thus 

i-5< Y /(^)' + ^ E E + ^ E /(^)'- 

AQ{l,...,d} AQ{1,...4} AQ{l,...,d} AQ{l,...,d} 

\A\<l/£ \A\>l/e \A\>l/e \A\>l/e 

It follows that 

E /(^)' ^ 

AC{l,...,d} 

\A\>l/e 

Now, choosing k = 1/e Theorem 17. 11 we get that 

(i\M<Y E nAf= Y \A\KAf<\. 

j=l AC{l,...,d} AC{l,...,d} 
\A\<l/e, jdA \A\<l/e 



25 



i.e. I J/3 1 < Thus, if we define 5 : F2 ^ M by 



then g depends on at most ^ coordinates. Moreover, by Theorem 17.11 apphed to the Boolean 
function (1 + /)/2, we get that 



[f{x)-g{x)fdix{x) 



AC{l,...,d} 



AC{l,...,d} 

|A|>l/e 



AC{l,...,(i} 

\A\<l/e, A\J^/,^% 



< 25 + A- C7Vi°S2(2/5) iog2 iog2 k . ^ 4fcy^' 



as required. 



In the proof of Theorem 17.11 we will use the following well known fact: For every / : F 
with f(x)diJ,{x) = 0, and every p G [1, 2], 



(15) 



This is true since by the Bonami-Beckner inequality |121 [5] , 



i/p 



\fix)\Pdi,{x) 



J2 {p-l)-f{A)WA{ 

AC{l,...,d} 

^ 1/2 

E ip-^r^m' 

AC{l,...,d} 

> v^-fE/(wn ■ 



2x 1/2 



Lemma 7.2. Fix t,6,(3 G (0, 1) and p G (1, 2). Lei /, J 6e too disjoint finite sets and / : F2 x F"^ 
{0, 1} a Boolean function. Assume that for every i G I, 

E /(^)' ^ 



AC/UJ 
|A|<fe, ieA 



and 



E 



ACIUJ 
\A\>k 
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Then 



y: fiA?< 



2t 



AC/UJ 

\Ani\=i 



\Ar\I\<k 



Proof. Fix t G (0, 1) which wih be specified later. Let {sj}jg/ be i.i.d. {0, 1} valued random 
variables with mean \ — t. Let S" C / be the random subset S = {i ^ I : Sj = 1}. Fix y G and 
for A C I denote 

Jy{A) = [ f{x,y)WA{x)dfi{x) = Y f{AUB)WB{y). 



Define c/^ : ^ R by 

9yix) = Yl fyi^WAix) = E E KAU B)WA{x)WBiy) = / . 



AC/ 
AgS 



AC/ BCJ 
AgS 



Then 



2 / b,(x)]2d/i(x) = / \gy(, 



x)\dfi{x). 



(16) 



To check this identity observe that if -0 : Fg — > {0, 1} is a Boolean function, with f^d ijj{x)d^{x) = P, 
then 2 -P)2d/i(x) = |V'(x) - P|d^(x) = 2P(1 -P). Thus ^ follows by fixing (xj)jes 
and {yj)j^j, applying this observation to the Boolean function {xj)j^j\s f(x,y), and then 
integrating with respect to {xj)j^s- 

Using (fT3|l . Holder's inequality, and (fTB|l. we get that 



1/2 



.iG/\5 



2p-2 



2p' 



I <2p 



-1 



1/p 



E fy^^y^ 



AC/ 

\a2S 



In other words. 



E(i--.)/.(W)^) ^ " (^^S('"M'')^'^^^' 



(17) 



Observe that t = — (1 — t) + 1 — Sj, so that, since p < 2, 



p/2 



- S^)fyi{^})' 



> *^/^E/^(«)' 

is/ 



p/2 

'Yis^-{l-t)))fy{{i}f 

p/2 

YiSi - {I - t)))fyi{i} f 



p/2 



(18) 
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Combining ((T7|) and ((TH|) . taking expectation (with respect to {sjjjg/), and integrating with respect 
to y E F2 , we get that 



AC/UJ 

|^n/|=i 



AC/ 



E 



j2{s^ - {I - mfy{{i}y' 



p/2 



dfi{y). 



(19) 



To estimate the second summand in (|19j) observe that 

p/2 / 



E 



5](s.-(l-t))/,({i})^ 



dfiiy) < E 



< 2E 



Y^{s,-{l-t))fy{{^}f 



d/i(y) J (20) 



^/^(y)] '(21) 



where in we used Jensen's inequality, and in (|21j) we used the fact that if are 
independent mean random variables then E \Xi + • • • + X„| < 2Ey^ X'^ + • • • + X^, which can be 
proved using the following standard symmetrization argument. Let Yi,...,!^ be i.i.d. copies of 
Xi, . . . , Xn, and let ei, . . . , e„ be i.i.d. independent ±1 Bernoulli random variables which are also 
independent of the {Xj}"^^^ and {Yj}'^^^. Then 





n 




n 




n 




n. 


E 




= E 


Y^iX, - EY,) 


< E 




= E 






i=i 




i=i 











< E 



A 



E, 



"^ej^Xj - Yj 



E. 



E(^. - ^.)^ < 2E^ 



Now, from the inequality 



fym) 

we get that 



mWAiy) 



AC/UJ 

Ani={i} 



< 



Y mWAiy) 



AC/UJ, |A|<A; 
An/={i} 



+ 



/(^)W^A(y) 



AC/UJ, |A|>fc 
An/={i} 



5^(1 - s.)fyi{i}Y 



1/2 


( ( 


) ^ ' 


E 




I'" I 







X 4x 1/2 



^E 

ie/ 



J] /(^)W^A(y) 



AC/UJ, |A|<fe 
Ani={i} 



+ 



Y mWAiy) 



AC/UJ, |A|>fc 
, An/={i} 



(1-s^). 
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Thus, by Parseval's indentity and the Bonami-Beckner inequahty we deduce that 



/ \ 1/2 



< 4 



E 

i&I 



. 4x 1/2 



ACIUJ, \A\<k 
. Anl={i} 



ACIUJ 
\A\>k 



< 4 



E 



6/ 



1/2 



J2 mWAiy) 



. ACIUJ, \A\<k 
\ Ani={i} 



dn{y) 



( 



( 



\ 



I 

2\ 1/2 



+ 4t(5 



AC/UJ, |A|<fc 
, An/={i} 



+ 4t(5 



< 4-3'^ 



1/2 



1/2 



max 



E /(^)^ 



V 



AC/UJ, |A|<fc 
An/={i} 



E E /(^)^ 

ie/ AC/UJ, |A|<fc 

An/={i} 



+ 4M 



< 4-3''v^ + 4t5. 
Combining (|22l) and ((2ll) with (UHl), we see that 



AC/UJ 

|An/|=i 



8t5 



p/2 



AC/ 



< 



AC/UJ 
|An/|<A; 



{p - l)P/2 



(22) 



□ 



Proof of Theorem \ 7. 1\ For every integer t > define 



E nA? 



We also write 



AC{l,...,d} 

2'"-i<|A\J^|<2'- 



s= E 

AC{l,...,d} 
|A|>fc 
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Fix < r < log2 k, which will be chosen presently. Let I be a uniformly random subset of 



a 



{1, . . . ,4 \ J/3 of size l-'-^id - \ Jfs\). li A C {1, ... ,4 satisfies l""^^ < \A \ Jp\ < 2^ then 
standard counting argument shows that Pr[|A n /| = 1] > C and H /| < C , where C, C are 
universal constants. Observe also that by the definition of J/3, the sets I, J = {1, . . . , d} \ / satisfy 
the conditions of Lemma l7.2l Taking expectation with respect to / of the conclusion of lemma E21 
we get that there exists a universal constant c > such that 

"'""'^^fFh^^ S 2V.+ (^+(3^v/]5r + («f'^ (23) 

Define 

AC{l,...,d} s<log2fc 
|A|<fc, A\J^^0 

Our goal is to prove that for some constant C > 1, 

^ < CVl°g2{2/5) logalogafc . (^^^ ^ _ (24) 

If 

2>s > -i^/k 



then choose r < log2 k to satisfy 



s<log2 k 



2V>^ r E (25) 

log2 /C , 
s<log2 fc 



In particular it follows that pr > ^^^^ ^ . Choose p = 2 — 2^ ^j^^^^^^. We may assume that 
p G (3/2,2), since otherwise (|^H) holds vacuously. Moreover, if 5^/^ > —77^7^ — r then ([2i|l holds 

/ \2/(2-p) 

true. Thus, choosing t = I tttt^ — -r I in (|23j) . and using (|25|) . we obtain (|24|) in this case. 



lOlogafc^ 

It remains to deal with the case Ss<iog2 ^^/^^ ^ 7\/A;. Choosing r such that > jj^^j^, and 
~ logl, fc ' ^ ~ fc(iogfc)"t ' shows that PH) holds true in this case as well. □ 
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